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Abstract. Zurek's derivation of Born's rule using envariance (invariance 
due to entanglement) is considered to capture the probability in full gener- 
ality, but only as applied to measurement of a quantum observable. Con- 
trariwise, textbook formulations of Born's rule begin with a pure state of a 
closed, undivided system. The task of this study is to show that a rearrange- 
ment of the Zurek approach is possible in which the latter is viewed as giving 
the probabilities for Schmidt states of an arbitrary composite state vector, 
and afterwards it is extended to probabilities in a closed, undivided system. 
This is achieved by determining simultaneously probability and measure- 
ment based on the fact that the physical meaning of probability and that of 
measurement are inextricably dependent on each other. 

PACS numbers: 03.65.Ta, 03.65.Ca 

1 Introduction 

The quantum probability law or Born's rule (as it is also called for historical 
reasons) is one of the fundamental pillars of physics. All attempts to derive 
it are of value, and so is Zurek's approach. 

This article is focused on the question how one can derive, within the 
Zurek approach, probabilities for any closed, undivided system. A simple 
procedure is presented in the next section, and it is pointed out that there 
is a price to be payed for the simplicity: contradiction with laboratory ex- 
perience. In section 3 it is shown that this difficulty can be surmounted 
if one derives also nondemolition measurement of incomplete observables 
with a finite number of distinct eigenvalues simultaneously with derivation 
of probability. 

Since this study comes somewhere at the end of the Zurek campaign at 
issue, first a short review of the latter is given. 
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Zurek's derivation of Born's rule from entanglement pQ, [2] stands out 
among the similar efforts because entanglement is the generator of decoher- 
ence, which, in turn, proved to be a fundamental quantum entity [3], [I], 
[H]) [S], [7J- Zurek made use of entanglement in terms of an entanglement- 
determined symmetry. He coined the word envariance for it. (Initially it was 
short for environment-assisted invariance, e. g. [8], [2], but later it evolved 
into the better term: entanglement-assisted invariance [9].) 

The initial envariance approach [1J gave rise to a number of commentaries 
and analyses in the literature [TO] , [H] , [TO] , [TO] , [H] , including that of the 
present author [TO], followed by [TO]. Zurek himself commented [2] on the 
articles of Schlosshauer and Fine [11] and of Barnum [12] . 

The study of the present author |15] came almost at the end of this series, 
even after Zurek's second paper [2]. It was aimed at a critical rederivation. 
It began with a comprehensive theory of twin unitaries - the other face of 
envariance; it was followed by a thorough and critical derivation of Born's 
rule for Schmidt states (see definition below) from entanglement along the 
lines of Zurek's reasoning in three stages but with twin unitaries instead of 
envariance. Finally, an attempt was made to extend Zurek's theory to the 
general case in stages 4 and 5. I think now that this attempt can be done 
better. 

Incidentally, my commentaries on some of the remarks in the preceding 
articles were left out in the published paper |15| . These remarks gave rise 
to my attempts at improvement. They can be found in the archive version. 

In his latest publication (as far as known to me) [6] Zurek gives on en- 
variance only the references [11], |12] and |15j (besides his own). (Perhaps 
the inventor of the envariance derivation does not consider the rest of the 
articles on the subject as useful contributions.) 

My argument in [15] for going beyond Zurek's result was based on ideal 
measurement and the stipulation that probabilities are predictions for the 
statistical weights of definite-result sub-ensembles in measurement (cf the 
passage between relations (22) and (23) ibid.). But this same stipulation (cf 
the last passage in [17] ) with ideal measurement was subsequently shown to 
be by itself sufficient to derive the probability law. 
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2 Closed, undivided system 



A simple derivation is presented in the first subsection. It is critically ex- 
amined in the second subsection. 

2.1 Derivation 

The result of Zurek's argument can be shortly put as follows. 

Zurek's envariance theorem Let | *S>)i2 be an arbitrary state vec- 
tor of a composite (1 + 2) system in which the subsystems are entangled. 
(The latter requirement is no more than ruling out a tensor product of sub- 
system state vectors.) Then, if it is written in the form of a biorthogonal 
decomposition (so-called Schmidt decomposition) 

I^)i2 = ^cti \i)i \i) 2 , Vi : cnj £ C, 

% 

Mi,i' : = (i b I «'>2 = (1) 

(this can always be done, though in general non-uniquely) , the probabilities 
of the Schmidt states {\i)\ ■ Vi} , or, more precisely, of the events (projec- 
tors) {| i)\(i |i: Vi} , are {|«i| 2 : Vi} coinciding with the values given by 
Born's rule. 

Remark The result is obviously symmetric in the two subsystems. Namely, 
it is an arbitrary, subjective decision which subsystem we write as first and 
which as second. The properties of the subsystems displayed in (1) are sym- 
metric under exchange of the latter. 

Actually, put in this way, Zurek obtained Born's rule only for Schmidt 
states. As a justification, entanglement due to measurement-like interaction 
is invoked suggesting that the environment can be replaced by a measuring 
instrument or that the latter can be part of the former. 

Let us take a different view of Zurek's approach. Let | (ft) be an 
arbitrary state vector of a simple, i. e., undivided, or a composite closed 
system. ('Closedness' is meant in the sense of lack of quantum correlations 
with the environment.) Let, further, 

A = J2 a k\ak)(ak\, k^k' => a k ^a k >, (2) 

k 

be an arbitrary complete observable in spectral form. (Completeness of an 
observable means that no eigenvalue is degenerate.) 
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We write down the expansion of \(f>) in the eigen-basis of the observable: 

\4>) = ^4>k I a*)- (3) 

k 

To derive probability for any case that is more general than the scope 
of Zurek's envariance theorem, one must add some new requirements to en- 
variance. But the adjoined requirements must fit naturally with probability 
and entanglement. Since both these concepts are purely mathematical un- 
less one falls back on measurement, we have thus a natural entity to consider 
along with envariance. 

One should have in mind that in any Copenhagen-inspired interpreta- 
tion of quantum mechanics measurement consists of the dynamical unitary 
evolution in which the interaction of object and measuring instrument is 
contained - called premeasurement [18] - and of collapse. In a no-collapse or 
relative-state approach measurement consists only of premeasurement. All 
the connection with probability is in premeasurement. Henceforth the terms 
"measurement" and "premeasurement" are used interchangeably. 

Let us take for our premeasurement the simplest one: that of ideal mea- 
surement of a complete observable A (cf (2)), and let us assume validity 
of Zurek's envariance theorem. The first step in the determination of pre- 
measurement is the requirement of the so-called calibration condition (CC) 
(see my comment on this in the next subsection): 

Vfc : \a k )i \ X °h -> l«fe)i lx fc >2, (4) 

with | x°)2 as the initial (or ready-to- measure) state of the measuring 
instrument, and {| x k )2 '■ VA;} as the orthonormal pointer states. (All 
first-subsystem entities appear with the index 1 when dealing with the 
composite system-plus-instrument system.) 

On account of expansion (3) and linearity of the evolution operator, this 
entails 

1 0>l [X°>2 -> 5> fc |a fc )i \x k h- (5) 

k 

The final composite state vector of premeasurement appears in (5) in a 
biorthogonal expansion. This is a consequence of the fact that {| at)i ■ 
VA;} and {[ x k )2 '■ V/c} are orthonormal eigen- vectors of the measured 
observable and the pointer observable (which is not explicitly made use 
of) respectively. Hence, Zurek's envariance theorem and the Remark imply 
that {!</>£; | 2 : VA;} are the probabilities of the corresponding pointer states 
{\x k h:Vk} . 
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Measurement applies to individual systems, but also to ensembles of 
these. The latter fact requires a second step in defining measurement, to stip- 
ulate the following: The probabilities of the pointer states {| \ k ) '■ ^k} *n 
the final composite state (5) that results from premeasurement, should equal 
those of the corresponding measured results {a^ : VA;} or {\ a^) : VTc} 
in the measured state \ (ft) . This is the so-called probability reproducibility 
condition (PRC, cf p. 28 in [IH]). 

It follows that we must require the probabilities {|</>fc| 2 : VA;} to be 
simultaneously also the sought-for probabilities of the corresponding results. 

This simple indispensable requirement achieves the goal to extend the 
scope of Zurek's envariance theorem to undivided closed systems. 

2.2 Critical remarks 

Measurement theory assumes knowledge of the probability law. Since we are 
deriving the latter, we must examine critically the concepts of measurement 
theory utilized in the above derivation. It goes 'from scratch', by which I 
mean that we assume the quantum formalism less the probability law and 
lacking the definition of measurement. One should note that hereby two 
important concepts are assumed to be valid: 

(i) State vectors are taken to represent pure states; more precisely, ho- 
mogeneous ensembles of equally prepared quantum systems. 

(ii) The so-called eigenvalue-eigenstate (e-e) link is assumed. It says that 
if an observable (Hermitian operator) A , a state vector | <j)) , and a real 
number (eigenvalue) satisfy an eigenvalue relation A \ (p) = a& | <fi) , 
then and only then the observable has the definite value a& in the state 
at issue. By 'definite value' one means a non-probabilistic concept: each 
individual system in the ensemble represented by | (f>) has the value a& 
of A . 

Naturally, the CC is based on the e-e link. As to PRC, it may have 
meaning even without a known form of the probability law because it re- 
quires only sameness. 

Perhaps a serious criticism against the derivation in the preceding sub- 
section is the remark that ideal measurement of a complete observable is 
actually impossible in the physical laboratory. 

Such measurement requires a measuring instrument with infinitely many 
pointer states {| \ k ) '■ k = 1, 2, . . . , oo} due to the infinite dimensionality 
of the state space of a real system. Such an instrument does not exist. In 
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actuality one can measure only observables with a finite number of distinct 
eigenvalues, and one does this with the help of a pointer observable with 
just as many pointer entities. 

The project of this study is to show that Zurek's approach allows one to 
overcome this difficulty. 



3 Joint derivation of probability and nondemoli- 
tion measurement. 

We now set the task to derive probability and measurement together from 
scratch. We take an incomplete observable A , i. e., we allow each of the 
eigenvalues a n of A to be arbitrarily degenerate. Then the spectral form 
in terms of the eigen-projectors {P n : Vn} of A reads: 

A = Y^ a n P n , n^n' => a n ^ a n >, (6) 

n 

and the completeness relation J2 n P n = ^ is valid, where / is the identity 
operator. 

The first necessity for measurement is a measuring instrument provided 
with a pointer observable B , the eigenvalues b n and the corresponding 
eigenprojectors Q n of which will serve to indicate the results of measure- 
ment. One has the spectral decomposition B = J2 n b n Q n with J2n Q n = 
I . (One utilizes the same index n as in (6) because a one-to-one relation 
between the eigenvalues of the measured observable and the pointer observ- 
able must have been established.) The measuring instrument is, further, 
assumed to be equipped with an initial (or ready-to-measure) state | x°) . 

The measurement takes place in the form of the evolution (premeasure- 
ment) 

U 12 (\4>)l |X°) 2 ) =|*}i2 = E™2 ( 7 «) 

n 

where the composite-system term vectors are defined as follows: 

Vn: \W2 = Q2 I*)i2 (76) 

(unnormalized vectors are over lined). 

The vectors | )™ 2 are 1-eigen-vectors of Q% and simultaneously 
the 6 n -eigen-vectors of the pointer observable B unless they are zero. 
(We write shortly instead of 1\ ® Q2 ■) 
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The probability reproducibility condition (PRC) was introduced in sub- 
section 2a, and it was pointed out that it is essential for the definition of 
ensemble measurement. In the present more general case it requires that 
the probabilities of the pointer events {Q n : Vn} in the composite state 
| v I / )i2 (cf (7a)) should equal those of the corresponding measured results 
{a n : Vn} or, equivalently, of {Pf : Vn} in the measured state 1 4>) . 

To establish (7a) as premeasurement, we must require also the calibration 
condition (CC) to hold. Let us start with it. 

We want to express the claim that if the event P" is certain (in the 
absolute, i. e., individual-system-in-the-ensemble sense) in an initial state 
\4>)i , then so is Q% m the final state |^)i2 (cf (7a)). 

The e-e link allows us to write the initial state in the CC in measurement 
in the form 

\4>)i = Pi n k)i- (8a) 

The entire CC then reads: If the initial state of the system is such 
that (8a) is valid, then the measurement evolution (7a) gives a final state 
satisfying 

|*)i2 = Q 2 l |*)i2- (86) 

We confine ourselves to nondemolition (synonyms: predictive, repeat- 
able, or first kind) measurement in which, by definition, if the measured 
observable A has a definite value a n in the initial state | cp) , it is not 
demolished,!, e., the resulting composite-system state \^}i2 (cf (7a)) still 
is an a n -eigenvalue state of A\ . 

If one has individual systems in mind, then the omitted - demolition, 
retrodictive, nonrepeatable or second-kind - measurements are not satisfac- 
tory, though they are more frequent in the laboratory, because one cannot 
verify after measurement the just obtained result on the measured individ- 
ual system for every result. 

Incidentally, this investigation bears some resemblance to Zurek's article 
[19] . But both the details and the purposes differ. 

To make the presentation more transparent, we henceforth enumerate 
those values of the index n for which the term vector | x &)™ 2 m (? a ) is 
non-zero by k . 

Thus, in nondemolition measurement one joins the following relations 
to relation (8b) 

Vfc: WK=Pi'\W2- (9) 
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Finally, starting with an arbitrary state vector \ <fi) , when coupled 
to the initial state of the measuring instrument and when nondemolition 
measurement is performed, one obtains 

^i2(|0)i |x°> 2 ) =U 12 [(j2P 1 n \<P)i) |X°} 2 ] =EM (10a) 

n k 

(the zero terms in the last expression are omitted, and this is emphasized 
by the change of the enumerating index), so that 

VA; : jtt>£ = = (106) 

and 

vfc: m 2 = (HiPi k ml /2 m k i2, 

where |^)i2 is the corresponding unit vector. 

Relation (10c) is due to the fact that the unitary evolution does not 
change the norm of any term P™ | cj))i \ x°) 2 m (10a). (Now we see that 
the omitted zero terms are those for which (<f)\i P™ \<p)i = ||P" |^)i|| 2 = , 
i. e., those n- values for which P™ takes \4>)i into zero.) 

The terms \^)\ 2 i n t ne fi na l state in (10a) are biorthogonal due to 
(10b). When one writes down any Schmidt decomposition of each state 
vector I ^>)\ 2 = J2i 4>M I kl)i \ kl) 2 , and one makes use of (10c), one arrives 
at 

*)i2 = E 1 *>i2 = X>li p i 1 ^i 2 E fa \ kl h 1 kl h- ( n ) 

k k I 

Here one has {kl\i\kl)\ = S k j.6 l j= (kl\ 2 \kl) 2 , i. e., one obtains a Schmidt 

decomposition of the entire composite final state \^)i 2 = J2k I ^)i2 • 

Now Zurek's envariance theorem (with the Remark) are applicable. They 
say that the sought-for probability of each Schmidt state | kl) 2 in the final 
state I*) 12 = Efc' I^)i2 is given by the expression ((</>|i Pf \(f))^j \(f> M \ 2 . 

To be able to avail ourselves of the PRC, we need to evaluate the proba- 
bility of Q\ in Y^k' 1^)12 for each k value. In order to use the obtained 
probability values of the Schmidt states \kl) 2 , we need to realize that 

Q2E \kl)2{kl\ 2 =Y, \klh(kl\ 2 , 
1 1 

or symbolically ^J2i I kl) 2 (kl \ 2 ^j < Q 2 (a sub-projector relation; proof 
is given in Appendix A). Hence, there exists a possibly nonzero projec- 
tor (Q 2 )' that is orthogonal to the sub-projector, and such that Q\ = 
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(Ej I kl) 2 {kl | 2 ) + (<$)' ■ But (E/ I kl) 2 {kl | 2 ) , just like , takes 
Efe' I^)i2 into the corresponding term l^)^ implying that (Q 2 )' , even 
if it is nonzero, acts as zero on J2k' \^)i2 ■ 

At this stage it seems unavoidable to introduce the additivity assumption: 
If a projector is an orthogonal sum of projectors P = J2 m Pm j then the 
probability p(P) of the corresponding event is the sum of the probabilities 
of the term events p(P) = ^2 m p{Pm) in any state. 

Since we need to allow our event (projector) Pf to have an infinite- 
dimensional range, we must require the equality to hold true even if the 
projector sum has an infinite number of non-zero terms. This is called a- 
additivity (it implies additivity). 

I can offer only intuitive plausibility support for the u-additivity stip- 
ulation. It starts with pointing to the fact that every event has an opposite 
event (every projector has its ortho-complemnentary event). But, in order 
to avoid perhaps unnecessary repetition, I ask the reader to read my detailed 
explanation in subsection V.E. of my former article [15] on the envariance 
derivation. 

On account of Zurek's envariance theorem and cr-additivity, we ob- 
tain that the probability of the event (projector) ( J2i \ kl) 2 (kl \ 2 in 

Efe' I^)i2 i s P\ as seen utilizing (11) and the probabilities of 

the Schmidt states \kl) 2 in (11). 

The probability of Q\ in J2k' I ^)i2 i s > 011 account of additivity, that 
of (El I kl) 2 (kl | 2 ) plus that of (Qf)' , which is zero. Hence also the 

probability of Q\ is in Efc'l^)i2 equal to (<^>|i Pf 1 4>)i ■ 

Finally, invoking the PRC, we obtain P^ \(f>)i as the probability of 
the eigen-event P^ in the state \<p)i . Thus, Born' 's rule for an arbitrary 
state vector and an arbitrary event is derived. 

Further generalization is straightforward. For the reader's convenience, 
it is presented in Appendix B. 

In view of the fact that every real (in the sense of laboratory-realizable) 
measuring instrument must have a finite number of distinct pointer events 
{Q 2 } ) one can make the following comment. If at least one of the eigen- 
values a n of the measured observable A is infinitely degenerate, then, 
in spite of the infinite dimensionality of the state space, the index n may 
take a finite number of values . This is why we are dealing with measure- 
ment that is, in principle, realizable in the laboratory. 
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4 Concluding remarks 



In the above, perhaps cumbersome, but essentially straightforward inter- 
twined derivation of measurement and probability we have seen that for 
deriving the former, the CC and the PRC seemed to be unavoidable as 
crucial partial determinations of the measurement concept. (Measurement 
becomes completely determined when so is probability.) 

The e-e link and cr-additivity were inescapable for the project of deriving 
probability. 

In every derivation of probability one is worried about possible circular- 
ity: could some assumption(s) be equivalent to the entire or to a part of the 
probability law?! 

Let me turn to a deep critical observation in the article of Schlosshauer 
and Fine In their Concluding remarks they say: 

"...a fundamental statement about any probabilistic theory: We 
cannot derive probabilities from a theory that does not already 
contain some probabilistic concept; at some stage, we need to 
put probabilities in to get probabilities out." 

At this place we must recall Gleason's theorem [20], which says that a 
u-additivity observing probability function on the entire lattice of events is 
equivalent to a general quantum state (and vice versa) via the trace rule 
tr(Ep) , where E is an arbitrary projector and p is an arbitrary density 
matrix. Gleason's fundamental theorem makes cr-additivity the reason for 
circularity in the sense of the above quote. 

One should not hold this insight against Zurek's approach because it is 
not a competitor to Gleason's theorem; it is a complementary effort. 

The derivation of the probability law from entanglement and additional 
assumptions serves the purpose to shed light on the 'miracle' how this law 
comes about. One does understand, at least mathematically, how super- 
position of tensor-product (hence uncorrelated) state vectors gives rise to 
entanglement. Zurek's envariance derivation project shows, at least to a 
large extent, how this brings about the trace rule (equivalent to Born's rule) 
via its envariance symmetry. 

Finally, let me point out that Zurek, who launched the envariance project 
to which this study attempts to be a modest contribution, is going on pre- 
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senting new ideas within the project [21| . 
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Appendix A 

Lemma Let | ^}i2 = J2i a i I Oi I O2 be a Schmidt decomposition of 
a bipartite state vector, let p2 = tvi( \ ^)i2(^ 1 12 J be the reduced den- 
sity operator of the second subsystem, and let Q2 be a second-subsystem 
projector such that Q2 I ^)i2 =| ^}i2 - Then Q 2 J2i I ^2^ | 2 = J2i 1 1)2(1 12 , 
symbolically | Z> 2 <Z | 2 ) < Q2 ■ 

Proof Q2P2 = tri (q 2 |*> 12 <*| 12 ) = P2- As well known, any Schmidt 
decomposition is accompanied by pi = J2i r i I ^2(^2 , where 77 (^ = \ai\ 2 ^j 
are the positive eigenvalues and | 1)2 are the corresponding eigenvectors 
of p2 ■ (We assume that the above Schmidt decomposition exhibits only 
non- zero terms.) Hence J2i ,r i'Q2 \l')2(l' b= J2i" r i" I 1"}2{1" | 2 • Taking 
the (I {2 ••• |Z)2 matrix element, one obtains {l\2 Q2 IO2 = 1 • From this 
Q2 1 2 =102 and the claimed relation ensue. □ 

Appendix B 

Completion from pure state to general state. 

Let us rewrite the obtained expression of the probability in a form suit- 
able for further generalization. Let P be an arbitrary event (projector). 
Then, we have derived that its probability in a pure state 1 4>) is 

p(P,\<P)) = (<P\P\<P)=tr(P\4>)(4>\) (B.l) 

Let p be a density matrix representing a proper mixed state. Let, 
further, 

P = ^2 w k (B.2) 

k 

be one way of writing the state as a mixture of pure states. The physical 
meaning of (B.2) is that p , as an ensemble, can be prepared by mixing 
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the pure states in (B.2) in the proportions given by the statistical weights 
w k . More precisely, the numbers N k of the individual systems making 
up the pure sub-ensembles \(p) k ((j)\ k relate to each other in the same way 
as the w k do. Hence, 

Vfc: w k = N k /N, N = J2N k ,. (S.3) 

k' 

Let us imagine that P is measured on each individual system in the 
ensemble p , and that the event occurs on N k systems in the k-th. sub- 
ensemble, altogether on J2k ^k systems in the entire ensemble. Since the 
probability applies to a random individual system, the sought for probability 
is EkN k /N (cf (B.3)). Thus, 

p(P,p) = Y,N k /N = Y, (N k /N) (N k /N k ) = ^> fe tr(P \</>)(</>\) = tr(Pp). 

k k k 

(BA) 

Evidently, all the previous relations (B.1)-(B.3) have been utilized. 

If p is a density matrix representing an improper mixed state, then, 
by definition, there exists another system with which our system is entan- 
gled, and together they are in a pure state. Denoting our system by index 
1 , the other system by index 2 , and the state of the (1 + 2) system by 
| ^)i2 , the improper mixed state is determined by the partial trace 

Pl =tr 2 ( |*}i 2 (*|i 2 ). (B.5) 

Making use of (B.4) and (B.5), it is straightforward to evaluate the 
probability p(P\,pi) of an arbitrary first-subsystem event Pi . 

p(Pi, pi) = tn 2 (Pi(| *)i 2 (* |i 2 )) = tri[Pi(tr 2 (| *}i2<* I12)] = tri(Pi^i). 

(B.6) 

(Note that indices are used unnecessarily also for full traces for trans- 
parency.) 

We have obtained the result that the formalism does not distinguish 
proper and improper mixed states. 

Naturally, all claims made in this Appendix are standard reasoning. 
They are required to complete the derivation. 
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